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Abstract 

The most common stochastic volatihty models such as the Ornstein-Uhlenbeck (OU), 
the Heston, the exponential OU (ExpOU) and Hull- White models define volatility as a 
Markovian process. In this work we check of the applicability of the Markovian approxi- 
mation at separate times scales and will try to answer the question which of the stochastic 
volatility models indicated above is the most realistic. To this end we consider the volatil- 
ity at both short (a few days) and long (a few months)time scales as a Markovian process 
and estimate for it the coefficients of the Kramers-Moyal expansion using the data for 
Dow- Jones Index. It has been found that the empirical data allow to take only the first 
two coefficients of expansion to be non zero that define form of the volatility stochastic 
differential equation of Ito. It proved to be that for the long time scale the empirical data 
support the ExpOU model. At the short time scale the empirical model coincides with 
ExpOU model for the small volatility quantities only. 
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1. Introduction 



The stochastic volatihty (SV) models with continuous time have been introduced into 
hterature in late of 80-s of the last century [U El El H] . According to these models the 
market dynamics is the two-dimensional stochastic process in which the asset price St 
obeys the stochastic differential equation in the Ito form ( the index t is omitted for 
simplicity) 

dS = i^Sdt + aSdWiit), (1) 

where the parameter /i is the drift coefficient, Wi{t) is a standard Wiener process and at 
is the volatility considered stochastic variable. 

The empirical analysis have established two important stylized facts concerning with 
the volatility. Firstly, this process has a long memory emerging, in particular, in that the 
autocorrelation function absolute returns decay very slowly with time. One can separate, 
at least, two characteristic time scales in the behavior of the autocorrelation function. 
At the initial stage there is a fast decay on the short time scale of the order of few days 
followed by the slow decay during a few months, defining the long time scale. Secondly, 
there is the negative correlation between past returns change and future volatility (so- 
called "leverage" effect). 

At the present different SV models are discussed in literature. To a certain extent these 
models are based either on the model of the geometrical Brownian motion or originate from 
Ornstein-Uhlenbeck (OU) process. It is assumed that volatility is a function a = g{Y) 
of a stochastic process Yt and that the djTiamic equation for Yt can be represented as a 
stochastic differential equation in Ito form 

dY = a{m-Y)dt + f{Y)dW2{t), (2) 
a = g{Y) . 

Eq.([2]) defines the so-called class of mean-reverting processes in which Yt goes to mean 
value m at t ^ oo with the velocity a. The quantity l/a is the time of relaxation of a to its 
equilibrium value approximately equal to g{'m) and actually represents the characteristic 
time scale of the process. The Wiener process W2{t) in general is correlated with the 
process Wiit). 
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Depending on g and / one can distinguish basically four frequently used SV models. 
1) The Ornstein-Uhlenbeck (OU) model [H [2] with a = Y and f{y) = k, where /c is a 
positive constant and 



da = a{m-Y)dt + kdW2{t) (3) 

2) The exponential Ornstein-Uhlenbeck (ExpOU) model [1] with a = and f{y) = k > 0, 
where it is assumed that Y = lna follows to the OU process and variable a, as it is easy 
to show in this case, satisfies the equation 



d> 



a = a 



2 



k 

a(lna — m) 

2 ^ ^ 



dt + kadW2it) (4) 



3) The Heston model [3] where a = and f{y) = k^, {k > 0). In this model it is 
assumed that the volatility is the OU process of the form ([3]). 

4) The Hull-White model Il]with a = f{y) = ky, {k > 0)) and 



da = — 
2a 



aim — a 



dt+^dW2{t) (5) 
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Originally the parameters of the models 1) - 4) were being estimated by a fitting to the 
empirical data from options pricing. Lately the question about the applicability of one or 
the other stochastic volatility model for describing a time evolution of stock prices, market 
indices or exchange rates is actively discussed in the physical literature. The fundamental 
problem is finding the most realistic model and estimating its parameters. 

One of the approaches is that parameters of a model are estimated by fitting the 
theoretical probability distribution functions (PDF) of returns to the empirical curves. 
So the studies carried out in works [HI El [TJ [8] have showed that the Heston model well 
enough reproduce the empirical distributions for Dow - Jones Index and a number of 
stocks. On the other hand, in the case of the high-frequency data, the Heston model, as 
well as Hull- White model, applied to German DAX Index give the return distributions 
not conforming to tails of the empirical curves [9] (see also |10j). 

The alternative approach is to estimate the parameters of the above models in such a 
way as to reproduce other the market stylized facts. In particular, in works [HI [121 [131 [E] 
the parameters of the OU, Heston and ExpOU models have been estimated by comparison 
of the theoretical predictions to the observed leverage effect. It has been showed that 



these models qualitatively reproduce the observed effect as a result of the choice of the 
parameters, however, the empirical data do not allow to assert the most appropriate 
model. As regards the autocorrelation function, in contrast with other models, ExpOU 
qualitatively reproduces the behavior of the empirical curve at medium and long times by 
fitting the parameters. On the other hand in order to take into account the occurrence 
of two time scale in the work [IS] (see also [TB]) a three dimensional diffusion model, 
assuming that the mean reverting level m is random, has been introduced. 

Thus by fitting the parameters above SV models sometimes can reproduce well enough 
the probability densities of the returns or describe specific observed authoritativeness (the 
leverage effect, behavior of autocorrelation function). However the question of choice of 
most realistic stochastic volatility model still remains open. 

Models 1) - 4) determinate the volatility as a Markovian process. This follows from 
the well-known fact that solutions of a SDE of Ito have the Markovian properties [17j. 
At the same time the empirical volatility autocorrelation function, decaying very slowly 
with time, shows , in general, the non-Markovian behavior. Furthermore as it has been 
noted above the autocorrelation function has at least two characteristic time scales. In 
this connection it is worth noting that the empirical analysis reveals the presence of a 
well-separated time scales in the dynamics of the volatility itself. 

So, LeBaron in [TH] has showed that the SV model, where the volatility behavior at 
short (~ 1 day), medium (~ 5 weeks) and long (~ 5 years) time scales is defined by three 
different stochastic processes, reproduces power law in the asymptotic of log returns of 
the Dow- Jones Index and long memory in the volatility fluctuations. 

In the work of J. -P. Fouque at al [19J (see also [20]) volatility dynamics both at short 
(a few days) and long (few a months) time scales was considered within the scope of 
the ExpOU model but with different relaxation times for each scale. For S&P 500 high- 
frequency data the short time scale has been found of order ~ 1.5 days. 

In this work we want to check the application of the above SV models separately both 
at short and long the time scales. If at the specific time scale the Markovian approximation 
is applicable, then the coefficients of the SDE of Ito written as 




(6) 
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can be obtained from the known expression of the theory of Markovian processes [2T] 



Dk{x) = \im—— / dx'{x' — x)''p{x' ,t + T\x,t), (7) 

where t'\x, t) is the conditional PDF and Dk are the coefficients of the Kramers- Moyal 
expansion. Such approach has recently been used in [23] and allowed directly from 
the data to estimate the coefficients of SDE of Ito for returns handling high-frequency dy- 
namics of DEM/USD exchange rates. Here this method is applied to both high-frequency 
and low-frequency data. The SDKs of the form ([6]) obtained for this two data sets have to 
define the volatility behavior both at short and long time scales. In the end this gives an 
opportunity to make a comparison with the known SV models and, to a certain extent, to 
answer the question of how consistent one or the other model is with the empirical data. 

The paper is organized as follows. Section 2 is devoted to describing the method of 
determination of the volatility time series. In Section 3 the coefficients of the Kramers- 
Moyal expansion have been obtained for different time scales. In Section 4. the numerical 
solution of the Fokker-Plank equation for conditional PDF is given and the convergence of 
the solutions to the stationary distributions is considered for both time scales. In section 5 
on the basis of the obtained SDE for volatility the simulation of the return series is carried 
out and its properties are studied. The analysis of the results is given in conclusion. 



2. The estimation of volatility and data sets 

Unlike prices changes the volatility is not directly observed. At the present there are 
a different methods of its estimation (for example, see 12^). Most frequently the 
volatility at the moment of time t is estimated as the standard deviation 

1 t+T 

<^'t=7f^Y.in'-<n>)\ (8) 
^ t'=t 

t+T 
t'=t 

where rt = In St/ St-A are log-returns and an average is carried out over time window 
T = (A^ — 1)A with an integer A^. 

The two different data sets have been used for the empirical analysis: the high- 
frequently data set (HFD) for the Dow- Jones Index (data sampled at 5 min intervals 
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from Feb. 16, 2001 to Feb. 26, 200dll) and low-frequently data (LFD) (with daily data 
for the Dow- Jones Index from Jan. 2, 1990 to Feb. 25, 200q3). One has respectively 
A = 5 min, T=2 hours for HFD and A = 1 day, T=l month (21 days) for LFD. The 
non-overlapping intervals of averaging T have been used for calculation of the volatility 
given Eq.([H]) and respectively the sampling time interval for the volatility data equals T. 

The obtained empirical values at have been used for the construction of the stationary 
distributions of the volatility and the conditionals PDFs. In doing so it has been assumed 
that the volatility is a stationary process ^14j . 

3. The estimation of the Kramers-Moyal coefficients 

According to what was said in the introduction we shall consider the volatility on the 
above indicated time scales in the Markovian approximation. In this it is known, 

the conditional probability density obeys a master equation in the form of a Kramers- 
Moyal expansion |21] . 

d ( 

— p(a,t|ao,to) = (^-^j Dk{a)p{a,t\aQ,tQ) (9) 
where the coefficients are defined as 

' At->0 At ^ ' 

and moments Mk are 

Mk((T,At) = ^f id - (jfpia,t + At\a,t)dd (11) 
k\ J 

In this section we shall calculate the coefficients Di and D2 of the expansion ([9]) and 
show that with enough accuracy the data set allows to take to be zero. According to 
Pawla's theorem [2T] at = all coefficients with A; > 3 vanish and the equation ^ 
reduces to a Fokker-Plank equation. In this case it is coefficients Di and D2 that define 
the form of SDK for volatility of the form (j6]). 

For the calculation of the moments the conditional densities p(cr, t -|- At|(T, t) (see 
Fig. 4) have been determinated from the empirical data and the numerical integration in 



^http:/ /www. finam.ru 
^http: / /finance. yahoo. com 



(11) has been performed. Further the approximation of the hmiting passage At ^ in 
(10) has been employed and coefficients Di and D2 have been obtained. 

Fig.l shows some typical dependence of the moments Mi 2 of At. In order to obtain 
the moments Mi^2(o", At) for the case of the small At, the volatility given Eq.(l8]) has been 
calculated using the overlapping intervals T. At the small a the moments Mi 2 are well 
enough described by the linear dependence on At. Therefore the limit in (|TU|) has been 
approximated as follows 

M,(a,At) ^ M,(a,At) 
At^o At At ^ ' 

at At = T. At large a values of moments fluctuate drastically because of the decrease of 

the statistical data. Nevertheless here too limit fllOj) has been approximated by relation- 
ship dn]) at At = T. 

The results of the calculation of the coefficients Di^2{(^) given by Eqs. fllOtlTTi) are shown 
in Fig. 2. It has turned out that for both data sets Di{a) can be approximated well enough 
by the function that coincides in form with the drift coefficient of the ExpOU model (jl]) 
(Fig. 2a; 2c). For approximation of the coefficient D2{cr) for LFD the square dependence 
on a has been used (Fig. 2d). For HFD for the small cr, D2{(y) can also be approximated 
by the square dependence on cr, however, for large cr it increases faster then the square 
function. Therefor for approximation -D2(cr) for all a the function has been used 

D2{<y)=bia^eMb2<T). (13) 

In the result of the fitting we have obtained: 
for HFD 

D[''\a) = -a{a^-a2Ha/ao)) (14) 
Di"\a) =hia^e^^{h2a) (15) 

where ai = —0.071 (month)~^; 02 = 26.5 (month)^^; bi = 7.08 (month)"^; 62 = 2.65 (month)^ 
for LFD 

D^^\a) = -a{a,-a2ln{a/ao)) (16) 
Di''\a)=ha' (17) 

where ai = 4.47 (month) 02 = 0.41 (month)"^; bi = 0.06 (month)"^. For both data 
sets erg is the mean volatility equal to 0.044 (month)"^^^. Correspondingly, the relaxation 
times are l/a^^^ = 1/4^^ = 0.79 day and = l/af^ = 2 A3 month. 



At last, the results of the calculation of the coefficients are represented in Fig. 3. 
It is shown that the values of the coefficient D4 in fact are equal to zero, the fluctuations 
do not exceed 10~^ for HFD and 3 ■ 10^^ for LED which is a few orders less than the 
corresponding values of Di and D2- 

4. The numerical solution of the Fokker-Plank equation 

As it has been noted in the previous section at = the master equation (I9l) reduces 
to the Fokker-Plank equation 

d id 92 1 

— p(o-,toko,to) = { + ^^2(cr) p(cr,to|cro,^o) (18) 

In this section we shall consider on the basis of a numerical solution of Eq. (ITSl) the time 
evolution of the conditional densities p(cr, t|cro, to) and show that the stationary solution 
of this equation is consistent enough with the empirical densities. 

Eq.f llSI) was solved with at boundary conditions at cr = and a = oo 

p(a,to|o"o,to)U=o;+oo = ((T > 0) 

and the initial condition p((T, tol^O) ^o) = ^(o" — o"o) where 5{x) is 5-function and ctq = 0.044. 
For the numerical solution of the Fokker-Plank equation (ITSl) the finite-difference method 
given in [22] has been used. 

The conditional densities p(o", t|o"o, to) for different times t are represented in Fig.4. As 
it is shown from Fig.4a for high-frequency data the stationary state is reached within t ~ 
1.5 days (from this on time the theoretical curves practically coincide). For low-frequency 
data the stationary state is settling within the time of the order of 5 months (Fig.4c). As 
it is shown from Fig.4b;4c the theoretical stationary distributions are consistent enough 
with the empirical volatility densities. To some extent this fact can serve as validation of 
estimating the coefficients Di 2- 
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5. The simulation of the return series 



As it is known fTfi [2T] the Fokker-Plank equation ( JTSll is equivalent to the SDE of Ito of 
the form 



This equation in combination with Eq. ([T]) enables to perform the simulation of the prices 
series that gives an opportunity to obtain a theoretical return PDF. 

In order to eliminate the parameter /i in Eq.([T]) let us introduce the new variable 
Xt = InSt/So — fit, where 5*0 is the initial price. It is easy to obtain that 



Using Eqs.f llQI) and ( !20l) and the explicit form of the coefficients Di and D2 for both 
data sets we have generated the series Xt. The Wiener processes Wi{t) and W2(t) have 
been assumed to be independent. The found price series have been used for the plotting 
of the probability density and the autocorrelation function of the absolute log-returns. 

Fig. 5 represents the plots of PDF of the prices changes Ax = In St/ St- a — /uA obtained 
from both the generated price series and the empirical data. As it is seen there is a good 
agreement between the corresponding curves. 

The plots of the autocorrelation function of the absolute log-returns \rt\ are given 
in Fig.6. In the case of high-frequency dynamics (Fig. 6a) there is a rapid decay of the 
empirical autocorrelation function at the time of the order of one day followed by a more 
slow decrease (solid line). The generated curves reproduces this abrupt drop (dashed line) 
at time of the order of 1.5 days. The same behavior is also exhibited for S&P500 Index 
|19j . The periodic oscillations of the empirical correlation function arises from a stable 
increase of the trade activity at both the beginning and the end of day. 

In the case of low-frequency dynamics the generated autocorrelation function repro- 
duces the initial drop of the empirical curve at times of the order of two months (Fig. 6b). 

6. Conclusion 

The SV models introduced in [H El [HI H] define the volatility as a Markovian process. 
On the other hand the volatility autocorrelation function shows the existence of two or 




(19) 



dxt = -dt + atdWi{t). 



(20) 



9 



more characteristic times, which, in general, is not typical for the Markovian processes. 
In recent works [ISl IIHI ED] the SV models have been considered, describing volatihty as 
a superposition of Markovian processes with different characteristic times. Using these 
approaches we consider volatility at both short and long time scales in the Markovian 
approximation. On the basis of the empirical data, employing Eqs. fllOl) and fill I) , we 
estimated the coefficients D12 of Ito SDE defining the volatility dynamics. It has been 
shown that for the long time scale the empirical data support the ExpOU model with the 
characteristic time = 2.43 months (see Eqs. (|T6l) and ( fTTl) ). On the short time scale 
the drift coefficient can also be described within the scope of this model with = 0.79 
days (Eq.l fT^ ). As regards the diffusion coefficient it shows more complicated behavior 
than a simple square dependence (see Eq.( |T5i) ). 

On the base of the numerical solution of the Fokker-Plank equation we have considered 
the time evolution of the conditional PDF p(cr, t|cro, to)- If has been shown that the 
stationary state settles in accordance with the found relaxation times within time of the 
order t ~ 2/a, where for the long time scale ~ 5 months and for short time scale 
^5 ~ 1, 5 days. Very good agreement is found between the calculated stationary densities 
and the empirical curves (Fig. 4). This fact supports the validity of estimating of the 
coefficients -Di,2- 

On the basis of Eqs (1191) and (1201) the price series Xt has been generated and PDFs of 
the price changes Axt = Xf — Xt-A for the different time delays A obtained. In the absence 
of a correlation between the Wiener processes Wi{t) and W2{t) the agreement between 
the simulated and empirical densities for both time scales is good enough (Fig. 5). 

On the basis of the generated data the volatility autocorrelation function has been 
obtained. The empirical autocorrelation function within the interval of a few months 
shows the existence of at least two characteristic time scales (Fig. 6). At the initial stage 
there is a drop within approximately 0.5 days followed by a slow decay. As it is seen from 
Fig. 6 the generated data at both short (within ~ 0.5 days) and long (within ~ 2 months) 
time scale separately describe such behavior. 

The obtained results deserve attention especially if one takes into account that the 
parameters D12 were not fitted specially to neither the empirical densities nor the behavior 
of autocorrelation function. 
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The parameters of the ExpOU model for the Dow- Jones Index were estimated in 
other works also. In [14] it has been shown that the fitting of the theoretical curve to 
the empirical volatility autocorrelation function at the interval of a few months gives the 
estimation of the relaxation time of the order of 35 ± 18 days. On the upper bound this 
estimation is close to our results for LFD. 

On the other hand the derivation of SDE for the volatility on the basis of the empirical 
data was also considered in [26] . The equation obtained in this work for the low-frequency 
dynamics of the Dow- Jones Index is close enough to the Eq.(jl]) of the ExpOU model. 
Numerical solution the the Fokker-Plank equation for the conditional PDF has shown 
that the stationary state settles within 3-4 months, which approximately corresponds to 
our data. 

Thus the results reported in this work show that the employment of the Markovian 
approximation at the individual time scale in all probability allows to describe at this 
scale a number of market appropriateness. In particular we have shown that the volatility 
autocorrelation function and the probability return densities obtained within the scope 
of this approximation are consistent enough with their empirical analogues separately at 
both short and long time scales. 
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Figure 1: The typical dependence of Mi 2(0", At) on At at small a (solid line). Circles 
correspond to the quantities obtained from Eq. ffTT]) . Figs, la, lb correspond to HFD set at 
a = 0.032; Figs. Ic, Id correspond to LFD set at cr = 0.02. 
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Figure 2: The coefficients Di^2{o', t) obtained from the empirical data using Eqs. ( iTTl) and 
ffT^ (circles). The solid line is the approximation by the smooth curve. Figs. 2a and 2b 
correspond to HFD set; Figs. 2c and 2d correspond to LFD set. The dashed line in Fig. 
2b shows the square dependence of D2 on a. 
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Figure 3: The dependence of the coefficient the on a ; (a) - HFD set; (b) - LED set. 
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Figure 4: The time evolution of the conditional PDFs p{at\aoto); (a) - HFD set, t — 
0.1; 1.5; 10; 12 days; (c) - LFD set, t = 0.5; 2; 5; 12 months; to = 0. The solid line in 
Figs. 4b and 4d correspond the stationary distributions obtained from the Fokker-Plank 

equation, respectively, for t = 10 days (HFD) and t = 12 months (LFD). Circles - the 
empirical distributions. The inset shows the same curves in the log - log scale. 
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Figure 5: The probability densities of the price changes Ax = In St/ St-A — /^A. The 
sohd hne have been obtained from the simulated data; circles - the empirical data; (a) - 
HFD set, A = 0.5; 2; 6.5 hours; (b) - LED set, A = 6; 24; 72 days. For convenience of 
presentation the PDFs are shifted in vertical direction by multiplication by 10. 
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Figure 6: The autocorrelation function of \rt\. The dashed line have been obtained from 
the generated data, the solid line obtained on the basis the empirical data; (a) - HFD set, 
(b) - LFD set. 
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